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Abstract 

We propose a new method for designing quantum search algorithms for finding a "marked" ele- 
ment in the state space of a classical Markov chain. The algorithm is based on a quantum walk a la 
Szegedy l24l that is defined in terms of the Markov chain. The main new idea is to apply quantum phase 
estimation to the quantum walk in order to implement an approximate reflection operator. This operator 
is then used in an amplitude amplification scheme. As a result we considerably expand the scope of the 
previous approaches of Ambainis |6] and Szegedy ll24ll . Our algorithm combines the benefits of these 
approaches in terms of being able to find marked elements, incurring the smaller cost of the two, and 
being applicable to a larger class of Markov chain. In addition, it is conceptually simple, avoids several 
technical difficulties in the previous analyses, and leads to improvements in various aspects of several 
algorithms based on quantum walk. 

1 Introduction 

1.1 Background 

At an abstract level, any search problem may be cast as the problem of finding a "marked" element from a 
set X with n elements. Let M C X be the set of marked elements. One approach to finding from M, if 
it is not empty, is to repeatedly sample from X uniformly until a marked element is picked. A more cost- 
effective approach re-uses resources expended in generating the first sample (time, random bits, black-box 
queries, etc.) by simulating the steps of a Markov chain with state space X to generate the next sample. This 
approach often takes advantage of some structure present in the ground set X and the Markov chain, and 
leads to a more efficient algorithm. In this article, we study quantum analogues of this randomized scheme. 

Discrete time quantum walks emerged gradually in the field of quantum algorithms (see [24] for a de- 
tailed introduction.) On the line they are related to the quantum cellular automaton model of Meyer lf2ll . 
Watrous [26] introduced quantum walks on regular graphs, and used them to show that randomized log- 
arithmic space is included in quantum logarithmic space. Afterwards notions related to quantum walks, 
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such as mixing time, and deviation from starting state, were studied for restricted graphs by several re- 
searchers |[22l 171131. suggesting the possibility of speed-up of classical algorithms based on random walk. 

Shenvi, Kempe, and Whaley 11231 pointed out the algorithmic potential of quantum walks by designing 
a walk based algorithm to emulate Grover Search lfl6l . The first algorithm using quantum walks that goes 
beyond the capability of Grover Search is due to Ambainis [6] for Element Distinctness. In his seminal 
paper he resolved the quantum query complexity of the problem, settling a difficult question that had been 
open for several years lfT2l [TI. Finally Szegedy |[24l developed a theory of quantum walk based algorithms. 
He designed a quantum search algorithm based on any symmetric, ergodic Markov chain that detects the 
presence of a marked element. He defined a notion of quantum hitting time that is quadratically smaller than 
the classical average hitting time. Since then, in the framework of Ambainis or Szegedy, numerous new 
algorithms emerged with substantially better complexity in a variety of contexts |[5l l20l[T3l[T9ll . 

This work develops a new schema for quantum search algorithms, based on any ergodic Markov chain. 
We extend the Szegedy quantization of classical Markov chains to possibly non-symmetric Markov chains, 
but its use is more in the style of the Ambainis algorithm. Departing from the two algorithms, however, 
we use quantum walks only indirectly. In conjunction with the well known phase estimation algorithm lfl8l 
H51 . the quantum walk helps us implement an approximate reflection operator. This operator may then be 
used within amplitude amplification algorithms lTT6l [lOl [171 for search. As a result, our work generalizes 
previous ones by extending the class of possible Markov chains, and improving the complexity in terms of 
its relation with the eigenvalue or singular value gap of the related Markov chain. In addition, our approach 
is conceptually simple, avoids several technical difficulties in the analysis of the earlier approaches, and 
leads to improvements in various aspects of the algorithms. 

1.2 Two subtly different search algorithms 

The optimal quantum algorithm for Element Distinctness discovered by Ambainis [ 6 ] recasts the problem 
in terms of search for a marked state in a Johnson graph defined by the problem instance. The algorithm 
may be viewed as a quantum analogue of the following search process, where P is the transition matrix of 
a Markov chain defined on state space X. 



Search Algorithm 1 

1. Initialize x to a state sampled from a probability distribution s over X. 

2. Repeat for ti steps 

(a) If the state y reached in the previous step is marked, then stop and output y. 

(b) Else, simulate t\ steps of the Markov chain P starting with the current state y. 

3. If the algorithm has not terminated, stop, and output 'no marked element exists'. 

The parameters t\ and t% in the algorithm are determined by the properties of the Markov chain and the 
marked subset M. The idea behind this algorithm is illustrated by considering an ergodic Markov chain P. 
If t\ is chosen large enough, the state y in step (l2aT ) above will be distributed (approximately) according to 
the stationary distribution of P. Then, the outer loop represents sampling from the stationary distribution 
until a marked element is found. If £2 is chosen to be inversely proportional to the probability that a state is 
marked according to the stationary distribution, the algorithm will succeed with high probability. 

The analysis of the Ambainis quantum algorithm depends heavily on the form of marked states, and 
was presented for subsets M arising out of A;-Collision, a generalization of Element Distinctness, with the 
assumption of a unique collision. Inspired by this algorithm, Szegedy ll24l designed a quantum search 
algorithm with uniform initial distribution, based on any symmetric, ergodic Markov chain. The Szegedy 
algorithm may be viewed as a quantum analogue of a subtly different, but more natural, classical process. 
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Search Algorithm 2 

1. Initialize x to a state sampled from a probability distribution s over X. 

2. Repeat for t steps 

(a) If the state y reached in the previous step is marked, then stop and output y. 

(b) Else, simulate one step of the Markov chain P from the current state y. 

3. If the algorithm has not terminated, stop, and output 'no marked element exists'. 

The parameter t is also determined by the Markov chain P, and the set M of marked states. This 
algorithm is a greedy version of the first algorithm: a check is performed after every step of the Markov 
chain to determine if a marked state has been reached, irrespective of whether the Markov chain has mixed. 
Let us formally derive the complexity of the two algorithms to clarify their differences. Assume that the 
search algorithms maintain a data structure d that associates some data d(x) with every state x G X. From 
d(x), we would like to determine if x G M. When operating with d, we distinguish three types of cost. 

Setup cost S: The cost to sample x G X according to the initial distribution s, and to construct the 
data structure d(x) for the state x. 

Update cost U: The cost to simulate a transition from x to y for a state x G X according to the 

Markov chain P, and to update d(x) to d(y). 

Checking cost C: The cost of checking if x G M using d(x). 

The cost may be thought of as a vector listing all the measures of complexity of interest, such as query and 
time complexity. We may now state generic bounds on the efficiency of the two search algorithms in terms 
of our cost parameters. 

Proposition 1. Let 5 > Obe the eigenvalue gap of an ergodic, symmetric Markov chain P on a state space X 
of size n, and let j^j > e > whenever M C X is non-empty. For the uniform initial distribution s, 

1. Search Algorithm 1 determines if a marked element exists and finds one such element with high 
probability ift\ = 0(|) and ti = O(-). The cost incurred is of order S + ~(|ll + C). 

2. Search Algorithm 2 determines if a marked element exists and finds one such element with high 
probability ift = O(^). The cost incurred is of order S + ^ (U + C) . 

Proof. The stopping time of Search Algorithm 2 is the average hitting time of the set M for the Markov 
chain P. We may therefore take t to be a constant factor more than this hitting time. As mentioned before, 
this time is bounded above by the stopping time for the first algorithm. Therefore part 2 of the proposition 
follows from part 1 . 

In the first algorithm, we may take ^ to be proportional to the average hitting time of the set M for the 
Markov chain P* 1 . The quantity A* (P), the magnitude of the eigenvalue of P with maximum absolute value 
among the eigenvalues different from 1, is bounded by 1 — 6 by hypothesis. The analogous quantity A* (P* 1 ) 
is therefore bounded by (1 — < e~ Stl . Taking t\ = 1/5, we get a spectral gap 5 of at least 1/2 
for P tl . We may now bound the average hitting time of M for P* 1 by, for example, Equation 15 in Ref. ll24ll 
and Lemma 1 in Ref. [11] (also stated as Lemma 10 in Ref. [24]). This bound evaluates to i < -. The 

eo e 

expression for the cost of the algorithm now follows. □ 

For special classes of graph, for example for the 2-d toroidal grid, the hitting time may be significantly 
smaller than the generic bound t = 0(l/fe) given in part 2 (see H Page 11, Chapter 5]). 
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1.3 Quantum analogues 

As in the classical case, the quantum search algorithms look for a marked element in a finite set X, where a 
data structure d is maintained during the algorithm. Let X d be the set of items along with their associated 
data, that is X d = {(x, d(x)) : x E X}. For simplicity we suppose that G X and that d(0) = 0. 

The quantum walks of Ambainis and Szegedy, and also ours, may be thought of as a walk on edges of 
the original Markov chain, rather than its vertices. Thus, that state space is a linear subspace of the vector 
space 7i = C x xX ,or 7id = C Xd x Xd when we also include the data structure. For the sake of elegance in the 
mathematical analyses, our data structure keeps the data for both vertices of an edge, whereas in previous 
works the data was kept alternately only for one of them. There is a natural isomorphism \ip) ^ \ip) d 
between TL and 7i d , where on basis states \x)d = \x, d(x)}. This isomorphism maps a unitary operation U 
on Ti into U d on H d defined by U d \ip) d = (U\ip)) d . 

The initial state of the algorithm is explicitly related to the stationary distribution tt of P. At each 
step, the right end-point of an edge (x, y) is "mixed" over the neighbours of x, and then the left end-point 
is mixed over the neighbours of the new right end-point. We will distinguish again three types of cost 
generalizing those of the classical search. They are of the same order as the corresponding costs in the 
algorithms of Ambainis and Szegedy. Some operations of the algorithms not entering into these costs will 
not be accounted for. This is justified by the fact that in all quantum search algorithms the overall complexity 
is of the order of the accounted part expressed in the specific costs. 

(Quantum) Setup cost S: The cost for constructing the state J2 X \J^x\ x ) d\0) d 

(Quantum) Update cost U : The cost to realize any of the unitary transformations and inverses 

\x) d \o)d ^ \x)dY,yVp^\y)d, \o)d\y)d ^ T,x\/^\ x )d\y)d, 

where P* = (p xy ) is the time-reversed Markov chain defined by the equations TT x p xy = n y p* x . 
(Quantum) Checking cost C: The cost to realize the unitary transformation that maps \x) d \y)d to 
— \x) d \y)d if x S M, and leaves it unchanged otherwise. 

The quantum search algorithms of Ambainis and Szegedy give a quadratic speed up in the times t±, ti 
and t, with respect to the classical algorithms. 

Theorem 1 (Ambainis [6]). Let P be the random walk on the Johnson graph on r -subsets of a universe of 
size m, where r = o(m). Let M be either empty, or the class of all r subsets that contain a fixed subset 
of size a constant k < r. The eigenvalue gap of P is 5 G ^(-), and the fraction of marked elements is 
either or e = f2(-^). Then, there is a quantum algorithm that determines, with high probability, if M is 
non-empty or finds the k-subset with cost of order S + -^=(-^U + C). 

Theorem 2 (Szegedy [24]). Let 5 > Obe the eigenvalue gap of an ergodic, symmetric Markov chain P, and 
let j^j > e > whenever M is non-empty. There exists a quantum algorithm that determines, with high 
probability, if M is non-empty with cost of order S + "^j(U + C). 

If the checking cost C is substantially greater than that of performing one step of the walk, an algo- 
rithm with the cost structure of the Ambainis algorithm would be more efficient. Moreover, the algorithm 
would find a marked element if one exists. These advantages are illustrated by the algorithm for Triangle 
Finding 11201 . This algorithm uses two quantum walks a la Ambainis recursively; the Szegedy framework 
seems to give a less efficient algorithm. Nonetheless, the Szegedy approach has other advantages — it applies 
to a wider class of Markov chain and for arbitrary sets of marked state. Moreover, the quantity 1 / y/Se in 
Theorem |2] may be replaced by the square-root of the classical hitting time 11241 . These features make it 
more suitable for applications such as the near-optimal algorithm for Group Commutativity fT9l which has 
no equivalent using the Ambainis approach. 
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1.4 Contribution, relation with prior work, and organization 

Our algorithm is a quantum analogue of Search Algorithm 1 and works for any ergodic Markov chain. It 
is most easily described for reversible Markov chains. 

Theorem 3 (This paper). Let 5 > be the eigenvalue gap of a reversible, ergodic Markov chain P, and 
let e > be a lower bound on the probability that an element chosen from the stationary distribution of 
P is marked whenever M is non-empty. Then, there is a quantum algorithm that with high probability, 
determines if M is empty or finds an element of M, with cost of order S + -^=(-^=U + C). 

Our algorithm considerably expands the scope of the approaches embodied in Theorems Q] and |2] above. 
It combines the benefits of both approaches in terms of being able to find marked elements, incurring the 
smaller cost of the two, and being applicable to a larger class of Markov chain. In addition, it is concep- 
tually simple, avoids several technical difficulties in the analysis of the earlier approaches, and leads to 
improvements in various aspects of algorithms for Element Distinctness, Matrix Product Verification, Tri- 
angle Finding, and Group Commutativity. Namely, we give a single-shot method for any algorithm a la 
Ambainis in presence of multiple solutions, without the need for a reduction to special cases such as that 
of a unique solution. This applies to Element Distinctness and Triangle Finding. For Element Distinctness, 
Matrix Product Verification, and Group Commutativity, where an algorithm a la Szegedy only detects the 
existence of a solution, we find one with the same time and query complexity. Finally, we improve the 
complexity of the best previous known algorithm for Triangle Finding from 0(n 13 polylog(n)) to 0(n 13 ). 

In Section |2l we describe a quantum analogue of a Markov chain based on the work of Szegedy ll24l 
who defined such a quantum process W(P,Q) for a classical bipartite walk (P,Q). By letting Q = P, 
he related the spectrum of the quantum walk W(P) to that of P for symmetric Markov chains. Using an 
absorbing version of P as in Search Algorithm 2, he designed a quantum analogue of this classical scheme. 
Even when P is not symmetric, letting Q = P* , the time-reversed Markov chain corresponding to P, leads 
to a natural connection between P and W(P). If P is reversible, then the eigenvalues of W(P) are closely 
related to those of P, as in the symmetric case. For an arbitrary, possibly non-reversible, ergodic Markov 
chain, this connection relates the eigenvalues of W{P) to the singular values a "discriminant" matrix D(P) 
associated with P. 

In Section |3l we use the quantum walk W(P) with the unperturbed walk P in a completely different 
way, more in the style of the Ambainis approach. Ambainis directly uses a power of W(P) to replace the 
"diffusion" operator in the Grover search algorithm. The beauty of this step, and the difficulty of proving 
its correctness, lies in the fact that even if no power of W{P) closely approximates the diffusion operator, 
some powers have sufficient properties to mimic its essential features. We introduce a novel way to approx- 
imate the diffusion operator. Our approach is both conceptually simpler, and more general. We observe 
that W(P) amplifies the spectral gap of a reversible Markov chain quadratically. We translate this to an 
efficient approximation to the Grover diffusion operator (Theorem [6]), using the well known phase estima- 
tion algorithm. We then begin an exposition of our algorithm by considering reversible Markov chains. To 
explain the basic idea of our approach, we first prove our main result with an additional logarithmic factor 
(Theorem 0). 

In Section 01 using a technique developed by H0yer, Mosca, and de Wolf ifTTl we show how to eliminate 
the logarithmic factor in the previous theorem, thus proving Theorem [3] 

In Section [5] we extend the algorithm to a possibly non-reversible Markov chain whose discriminant 
D(P) has non-zero singular value gap (Theorem [9]). The complexity of the algorithm in the general case is 
similar to the one for reversible Markov chains. The sole difference is that the singular value gap of D(P) 
takes the place of the spectral gap of P. While the eigenvalues of Markov chains are well studied, we are 
not aware of a similar theory for singular values of this matrix. Nonetheless, we believe that such a general 
result may prove useful for future applications. 
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For simplicity, in our algorithms we assume that the number of marked elements is known in advance. 
This assumption may be removed using standard techniques, without increasing the asymptotic complexity 
of the algorithms O. 

2 Quantum analogue of a classical Markov chain 

Let P = (p xy ) be the transition matrix of any irreducible Markov chain on a finite space X with \X\ = n. 
By the Perron-Frobenius theorem, the chain has a unique stationary distribution ir = (tt x ), that is a unique 
left eigenvector tt with eigenvalue 1 and positive coordinates summing up to 1. Since we will only consider 
irreducible Markov chains in this paper, from now on we may omit the term 'irreducible' in our discussions. 
We will also often identify a Markov chain with its transition matrix P. 

Let P* = (py X ) be the time-reversed Markov chain defined by the equations ir x p xy = Tt y p yx . We will 
define a quantum analogue of an arbitrary irreducible Markov chain P, based on and extending the notion of 
quantum Markov chain due to Szegedy [24]. The latter was inspired by an earlier notion of quantum walk 
due to Ambainis [0. We also point out that a similar process on regular graphs was studied by Watrous [26]. 

For a state \tp) G TL, let IL, = denote the orthogonal projector onto Span(|^)), and let ref(tp) = 

2IL^, — Id denote the reflection through the line generated by \tp), where Id is the identity operator on TL. If 
K, is a subspace of TL spanned by a set of mutually orthogonal states {\ipi) : i G /}, then let Hfc = Yliei H^i 
be the orthogonal projector onto /C, and let ref (/C) = 211k; — Id be the reflection through K. 

Let A = Span(\x)\p x ) : x G X) and B = Span(|p*)|y) : y G X) be vector subspaces of TL = C XxX , 
where 

\Px) = J2 y ex V^v\y) and H) = Sxex \/Py~x~\ x )- 

Definition 1 (Quantum walk). The a unitary map W{P) defined on TL by W{P) = ref (B) • ref (A) is called 
the quantum walk based on the classical chain P. 

This quantum walk extends to a walk W{P)d on the space TL augmented with data structures, as ex- 
plained in Section [131 Recall that U is the quantum update cost as defined in the same section. 

Proposition 2. The quantum walk with data W(P)d can be implemented at cost 4U. 

Proof. The reflection ref(^4)<j is implemented by mapping states \x}d\p x )d to \x)d\0)d, applying ref(TLd ® 
\0)d), and undoing the first transformation. In our accounting we charge unit cost for the second step since 
it does not depend on the database. Therefore the implementation of ief(A)d is of cost 2U. The reflection 
ref (B) may be implemented similarly. □ 

The eigen-spectrum of the transition matrix P plays an important role in the analysis of a classical 
Markov chain. Similarly, the behaviour of the quantum process W(P) may be inferred from its spec- 
tral decomposition. For this, motivated by Szegedy lF24l . we consider the discriminant matrix D(P) = 
(y/PxyPyx)- Since ^p xy p yx = \f : K x ~p X yl ' sfKy, the discriminant matrix is equal to 

D(P) = diag(vr) 1 / 2 •P-diag(vr)- 1 /2 ) 

where diag(-7r) is the invertible diagonal matrix with the coordinates of the distribution it in its diagonal. 
Since the singular values of D(P) all lie in the range [0, 1], we may express them as cos 8, for some an- 
gles 6 G [0, w]. For later reference, we rewrite Theorem 1 of Szegedy [24 ] which relates the singular value 
decomposition of D(P) to the spectral decomposition of W(P). 

Theorem 4 (Szegedy [24]). Let P be an irreducible Markov chain, and let cos 9\, . . . , cos 9[ be an enumer- 
ation of those singular values (possibly repeated) of D(P) that lie in the open interval (0, 1). Then: 
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1. On A+B those eigenvalues ofW(P) that have non-zero imaginary part are exactly e 1 , . . . , e l , 
with the same multiplicity. 

2. On AnB the operator W(P) acts as the identity Id. Ad B coincides with the left (and right) singular 
vectors of D(P) with singular value 1. 

3. On An B 1 - and A 1 - n B the operator W(P) acts as —Id. A n B (respectively, A x D B) coincides 
with the set of left (respectively, right) singular vectors of D(P) with singular value 0. 

4. W(P) has no other eigenvalues on A + B; on A 1 - Pi B 1 - the operator W(P) acts as Id. 

We define A(P), the phase gap of W(P) as 26, where 6 is the smallest angle in (0, |) such that cos 9 
is a singular value of D(P). This definition is motivated by the previous theorem: the angular distance of 1 
from any other eigenvalue is at least A(P). 



3 From quantum walk to search 

3.1 Outline of search algorithm 

We now describe a search algorithm that may be viewed as a quantum analogue of Search Algorithm 1 of 
Section [PI Consider the following quantum state in the Hilbert space TL: 

K) =£xexV 5 ^l iB >b*> =£ y eX\/%K>l2/>- 
This state will serve as the initial state for our algorithm, and corresponds to starting in the stationary dis- 
tribution 7r in the classical search algorithms. Assume that M^0. Let M. = C MxX denote the subspace 
with marked items in the first register. We would like to transform the initial state |vr) to the target state \p,}, 
which is the (normalized) projection of \ir) onto the "marked subspace" M.: 

\v) = ||n^K)|| = 7? £zeA/ ■s/^\x)\p x ), 
where e = ||IL>t |7r) || 2 = 2~2 X ^M n % * s tne probability of a set M of marked states under the stationary distri- 
bution 7T. Roughly speaking, we will effect this transformation by implementing a rotation a la Grover lfl6l 
in the two-dimensional real subspace S = Span(|-7r), |/u)) generated by the states. 

Ideally, we would like to effect the rotation ref(7r)d • ief(p ± )d in Sd, where l/i -1 ) is the state in S 
orthogonal to which makes an acute angle with \ir). The angle ip between \ir) and \p>' L ) is given 
by sin ip = (^|vr) = sfs. The product of the two reflections above is a rotation by an angle of 2tp within the 
space S. Therefore, after 0(1/^) = 0(1/^) iterations of this rotation starting with the state \ir), we will 
have approximated the target state 

Restricted to the subspace S, the operators ref( / u _L ) and — ref(.M ) are identical. Therefore, if we ensure 
that the state of the algorithm remains close to the subspace S throughout, we would be able to implement 
ref (p, 1 - )d- it involves checking at cost C whether an item in the first register is marked. 

The reflection ref (-7r)d is computationally harder to perform. The straightforward strategy would be to 
rotate | vr) ^ to the state |0)rf|0)rf, use ref (|0)d|0)d), and then undo the first rotation. However, rotating | vr) ^ 
to |0)d|0)d is exactly the inverse operation of the preparation of the initial state 1 7r)^ from |0)d|0)d, and 
therefore requires the same cost S + U. This may be much more expensive than the update cost 4U incurred 
by the walk W(P)d- To use therefore W(P)d, our idea is to apply phase estimation to it, and exploit this 
procedure to approximate the required diffusion operator on Ad + Bd which contains the subspace Sd- 

3.2 Diffusion operator based on quantum walk 

To explain our approach, in the rest of this section, and in the next one, we assume that the classical Markov 
chain P is ergodic and reversible. Let us recall that a finite Markov chain is ergodic if it is irreducible and 
aperiodic. The Markov chain P is said to be reversible if P = P* , where P* is the time reversed chain. For 
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a reversible chain the corresponding discriminant D{P) is symmetric. Symmetry implies that the singular 
values of D(P) equal the absolute values of its eigenvalues. Since D(P) = diag(vr) 1 / 2 • P ■ diag(7r) -1 / 2 
is similar to the matrix P, their spectra are the same. Therefore, we only study the spectrum of P. The 
Perron-Frobenius theorem and the ergodicity of P imply that the eigenvalue 1 is the only eigenvalue of P 
with absolute value 1 and has multiplicity 1. The corresponding eigenvector of D{P) is (y^), and every 
singular (or eigen-) vector of D{P) orthogonal to this has singular value strictly less than 1. Transferring 
this property to the quantum walk W(P) via Theorem HI \ir) is the unique eigenvector of the unitary opera- 
tor W(P) in A + B with eigenvalue 1, and the remaining eigenvalues in A + B are bounded away from 1. 
We use this observation to identify the component of any state \ip) G <S perpendicular to \n). 

The main idea in our implementation of the above approach is to use phase estimation [TU [15] . 

Theorem 5 (Phase estimation, Cleve et al. B15I0 . For every pair of integers m,r > 1, and a unitary opera- 
tor U of dimension mxm, there exists a quantum circuit C(U) that acts onm+s qubits, where s = r + 0(1) 
and satisfies the following properties: 

1. The circuit C(U) uses 2s Hadamard gates, 0(s 2 ) controlled phase rotations, and makes 2 S+1 calls 
to the controlled unitary operator c—U. 

2. For any eigenvector ofU with eigenvalue 1, i.e., ifU\ip) = \ip), then C(U)\iIj)\O s ) = \ip)\O s ). 

3. IfU\ip) = e lS \ifj), where 6 £ (0, 2tt), then rounding off the number in the last s qubits of the state 
C(U)\ip) \0 S ) to the r most significant bits gives the best r-bit approximation to 9/2tt with probability 
at least 2/3. 

Moreover the family of circuits C parametrized by m and s is uniform. 

In the following theorem we show how phase estimation can be used to design a quantum circuit R(P) 
which implements an operation that is close to the reflection ref(-7r). 

Theorem 6. Let P be an ergodic Markov chain on a state space of size n, and let m = n 2 . Then there is a 
constant c > such that for any integer k, there exists a quantum circuit R(P) that acts on m + ks qubits, 
where s = log( ^(p) ) + 0(1), and satisfies the following properties: 

1. The circuit R(P) uses 2ks Hadamard gates, 0(ks 2 ) controlled phase rotations, and makes at most 
k 2 S+1 calls to the controlled quantum walk c— W(P), and its inverse c— W(P)^. 

2. If\ir) is the unique 1-eigenvector of W (P) as defined above, then R(P)\ir)\O ks ) = \ir)\O ks ). 

3. If\t/j) lies in the subspace of A + B orthogonal to \ir), then \\(R(P) + Id)|V>)|O fcs )|| < 2 1 ~ ck . 
Moreover the family of circuits R(P) parametrized by n and k is uniform. 

Proof. We describe the circuit R(P). We start by applying the phase estimation theorem to the quantum 
walk W(P), a unitary operator of dimension mxm. Then using standard methods, we amplify the success 
probability of the resulting algorithm. We repeat it k times and take the median of the answers, increasing 
the success probability to 1 — 2~ ck , for some constant c > 0. Observe that only the number of ancillary 
qubits increase from s to ks in this process; we do not need additional copies of the eigenvector \ip). 

This approximately resolves any state \ip) in A + B along the eigenvectors of W(P) by labeling them 
with the corresponding eigenvalue phases. We now flip the phase of all states with a non-zero estimate of 
the phase, that is flip the phase of all eigenvectors other than | it) . Finally, we reverse the phase estimation. 

The state | -7r) 1 fcs ) stays unchanged. When is orthogonal to \n) then it is the linear combination 
of eigenvectors of W{P) whose eigenvalues are of the form e ±2ie ', where A(P)/2 < < ir/2. Since 
20/2-k > A(P)/8,the (log(l/A(P))+3)-bit phase estimation is non-zero with probability at least l-2~ ck . 
We can therefore decompose \^}\0 ks ) into the sum \ipo) + \ipl)> where the phase estimation is zero on IV'o}* 
non-zeroon|V>i>,and|||^o>|| < 2~ cfe . Then R(P)\ ^)|0 fcs ) = |^o)-|^i), and (i?(P)+Id)|V')|0 A;s ) = 2|^o) 
whose norm is at most 2 1_cfc . □ 
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3.3 The search algorithm for reversible Markov chains 

Let us consider the following quantum procedure. 



Quantum Search (P) 

1. Start from the initial state jvr)^. 

2. Repeat 0(l/y / e)-times: 

(a) For any basis vector \x)d\y)d, flip the phase if x S M. 

(b) Apply circuit R(P) d of Theorem [6] with k = 0(log(l/ v / e)). 

3. Observe the first register. 

4. Output x if x £ M, otherwise output 'no marked element exists'. 



Theorem 7. Le? 5 > be the eigenvalue gap of a reversible, ergodic Markov chain P, and let e > be 
a lower bound on the probability that an element chosen from the stationary distribution of P is marked 
whenever M is non-empty. Then, with high probability, the procedure Quantum Search (P) determines if 

~(t^7?) u + C • 



M is empty or finds an element of M with cost of order S + 



Proof. For the mathematical analysis we reason in the Hilbert space TL, without the data stiuctures, and also 
suppress the ancilla qubits used by the circuit R{P). We know that in the two-dimensional subspace S = 
Span(|7r), \p)), the Grover algorithm, consisting in 0(1/ y/e) iterations of ref (tt) ■ ref (// ), turns the vector 
\ir) into a state whose inner product with \p) is 1 — 0( v / e). Using a hybrid argument as in (H|25l, we prove 
that the algorithm Quantum Search (P) simulates, with an arbitrarily small error, the Grover algorithm, 
and therefore finds a marked element with high probability, whenever such an element exists. 

For i > 0, we define \4>i) as the result of i Grover iterations applied to \tt), and \ipi) as the result of i 
iterations of step (2) in Quantum Search (P) applied to \tt). We show by induction on i, that — \<j)i) || < 
i2 1 ~ ck , where c is the constant of Theorem[6] Indeed, we can write \ipi) as \4>i) + (\ipi) — \4>i))- The action 
of ref(/i _L ) and — xei(M) are identical on \4>i) since it is in S. Set |r) = \4>i+\) + R(P) ■ ref(A4)\4>i). 
Since Tei(Ai)\<pi} is in S, and S is a subspace of A + B, conclusion (3) of Theorem |6]can be applied, which 
implies that |||r)|| < 2 1 ~ ck . Using that IHV'i+i) ~ ^ lll r )ll + IllV'i) - |<^i)||> the statement follows. 

This implies that \\\tpk) — \<j>k)\\ i s an arbitrarily small constant when k is chosen to be 0(c _1 log(l/y / e)). 

Let us now turn to the cost of the procedure. Initialization costs S + U, and in each iteration the single 
phase flip costs C. In the circuit R(P)d, the controlled quantum walk and its inverse can be implemented 
with four update operations, each of cost U. Indeed, the implementation of W(P), described in the proof 
of Proposition [2] works also for the controlled quantum walk if we replace xef (TLd <8> |0)^) by the controlled 
operator c— ref (TLd <8> \0)d)- Since the controlled reflection is also of unit cost, this change does not alter the 
cost of the implementation. 

In R(P)d the number of controlled quantum walks and its inverse is 0((1/A(P)) log^/y^)). We claim 
that A(P) = £1(^/5). Let Ao, A n _i be the eigenvalues, with multiplicity, of P such that 1 = Ao > |Ai| > 
■ ■ ■ > |A n _i|. Since the discriminant D(P) is similar to P, their spectra are the same, and therefore the 
singular values of D(P) are | Ao| , | Ai| , . . .. By definition, A(P) = 26\, where cos 9\ = \\\\. The following 

3 2i6»i I 



(in)equalities can be easily checked: A(P) > 
analysis. 



1 



2 Jl - | Ai| 2 > 2yj5(P). This finishes the cost 



□ 



Let us observe that the origin of the quadratic speed-up due to quantum walks may be traced to the 
quadratic relationship between the phase gap A(P) of the quantum walk W(P) and the eigenvalue gap 8 of 
the classical Markov chain P, observed at the end of the previous proof. 
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4 Search with approximate reflection operators 



In this section, we describe how our approximate reflection operator may be incorporated into a search 
algorithm without incurring additional cost for reducing its error. The basic idea is to adapt the recursive 
amplitude amplification algorithm of H0yer, Mosca, and de Wolf ifTTTl to our setting. To describe it, we 
use the notation from Section 13.11 where we discussed how the Grover algorithm works to rotate a starting 
state \ir) into a target state where (h\tt) = sin ip = yfe. We define procedures Ai recursively. Let the 
procedure A$ be the identity map Id, and for i > 0, let 

Ai = Ai-i ■ ref (tt) • A\_ x ■ ref^ 1 ) • A^. 

We define the states |7Tj) as Aj|7r). Then forms an angle 3 i ip with l/U- 1 ), and therefore the state \ir t ) is 
close to when t = log 3 ^ + 0(1). The final recursive algorithm is thus At. 

We may estimate the cost Cost(i) of this search algorithm in terms of the cost c of implementing the 
two original reflections. We have Cost(O) = 0, and for i > 1, Cost(i) = 3 • Cost(z — 1) + c, and therefore 
the cost of At is 0(c/ ' yfe). 

The recursive amplitude amplification algorithm is more suitable for situations where we have imperfect 
procedures that implement the basic reflections ref (tt), ref (fJ. ± ). H0yer et al. ifTTl demonstrated this when 
there is an ideal (error-free) procedure for ref(-7r), and a procedure for ref(^ _L ) that has ideal behaviour 
only with high probability. We extend this to the case where the first reflection may also be imperfect. In 
the context of quantum walk based search, an imperfection appears in the form given by Theorem [6l The 
basic idea is to create an analogue of the recursive algorithms Ai when ref (tt) is replaced by increasingly 
fine approximations based on Theorem [6] For the sake of simplicity, we only deal with the case when the 
implementation of ref (/x -1 ) is ideal. 

Theorem 8. Let | vr) be some state in a Hilbert space TC, let M C TC be a subspace ofTC, and let e = 
||nA/!|7r)|| 2 , where ILx is the projector onto M.. Assume that for any (3 > 0, we have a quantum circuit 
R(f3) acting on TC (g) K,, where K, is an extra register of s qubits (s = may depend on j3), with the 
following properties: 

1. The circuit R(f3) has a cost of order 0(ci log ^). 

2. i?(/?)|7r)|0*) = |7r)|0 s ). 

3. If\ip) is orthogonal to |vr), then \\(R(P) + Id)|V>}|O s ) || < 13. 

Further, assume that we are able to apply — ref(A4) with a cost 0(c2). Then, for any 7 > 0, there exists a 
quantum circuit that maps \ir) to a state that has projection of length at least — 7 in M, and incurs a 

cost ofO (± ■ (ci log ^ + c 2 ) 



Proof. Let t be the smallest non-negative integer such that 



3* sin" 1 Ve G 



TT 3TT 

4'T 



We have t = log 3 4g + 0(1). The quantum circuit consists in t recursive amplitude amplification steps and 
acts on TC (g) [®- =1 /Cj] , where /Q is an extra register used at step i. Let s« = s(/3j) be the size of register 
/Cj, where (3i will be specified later. Let 5 = Yl\=! s i- 

The quantum circuit follows exactly the recursive amplitude amplification algorithm explained in Sec- 
tion HI We use 1 0o ) = \4>) = l 71 ")^ 5 ) as tne initial state and essentially replace ref(0) at step i by an 
approximation R((3i), acting on TC (g> /Cj, and Id on the rest. The precise definition of the algorithm Ai in 
terms of the reflection operator Ri used in this recursive step is as follows. Let Aq = Id. For i > 1: 
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Approximate reflection Ri 

The algorithm acts on H (g> 
it 



1. Apply 

2. If the registers corresponding to ®*- =1 K,j are in state |0 S ), apply R{fa) on < 

3. Otherwise, reflect through the origin, i.e., apply —Id. 

4. Apply Ai-L 



Further, define 

At = Rt ■ ref(M) • A-i 
|&) = Ai |0 O > 

Since is an approximation to ref(0 o ), we see that is an approximation to ref (<^_i). Let Ei 

Ri — ref (0i_i) be the error made in our implementation of ref (</>i-i). 

Fact 1. £*j satisfies the following properties: 

1. Ej\<t>i_i) = 0, awJ 

2. < A V |V> -L \<f>i-i). 



To analyze this algorithm, we keep track at all steps of the projection of on the marked subspace. 



The marked subspace corresponds to Ai < 



; it consists of states in which the H-part is marked. For 

ease of notation, we will denote this space by M. Define the normalized projections of on the marked 
subspace M and on its orthogonal complement as: 



n 



M\ 



\\^m\</h)\\ 
(id-n^)|^) 
||(id-n^)|^)| 



We thus have 



\(j>i) = sirnpi \m) + cos^j 



(1) 



where sin 2 tpi = ||ILvf || 2 is the success probability at step i, that is the probability of finding a marked 
item by measuring the first register according to {rLvf, Id — rLvf}. For later use, let us also define the state 
\<j)j-) as the state below in the 2-dimensional subspace spanned by and \p,j~) that is orthogonal to \cf>i): 

\<t>t) = cosfi \m) -simpi \n±). 

For the initial state \(j)o), we have sin 2 ipo = e, by the hypothesis of the theorem. If all the errors fa 
were zero, our recursive algorithm would implement an amplitude amplification in the subspace spanned by 
\Hi) = \hq) and \fij-) = \/j,q), with the angles (fi+i = Stpi, that is tpi = 3Vo- Therefore by recursively 
iterating our procedure for a total number of t steps, we would end up with a state whose inner product with 
|/xo) is at least 
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Analysis of the errors. We show that the algorithm still works when the errors are sufficiently small. 
In that case, the 2-dimensional subspace {|/Uj), )} may drift away from the initial subspace {|/Uo), \hq )}, 
and the angles (pi may be different from the ideal value ipi = 3 l </?o- We derive bounds on the error ef 

ei = \sin(fi - sin<f>i\ , (2) 

the difference between the amplitude sin ipi of the marked part of the state \<fii) and the ideal ampli- 
tude, sin<^i. 

We assume without loss of generality that < 7 < since the case 7 > is vacuous. We prove that 
after t steps et < 7. This will conclude the error analysis since ^= < sin <p t < 1. 
We have 

|&+i> = Ri+i • ?ef(M) Ifa) 

= ref(^) • ref(M) |&) + E i+1 ■ ref(TW) |&) 

= sin3(^j + cos3<^ + l^i+i), (3) 

where we used the fact that ref(<fo) • ref(A^) implements a perfect amplitude amplification step, and we 
introduced an error state \u>i), defined as 

\u) i+ i) = E i+1 ■ ref(M) \<f>i) 

= E i+ i ■ ref(M) (simpi \m) + cos (pi \fj,j 

= E i+ i ^-sin<^j \m) + cosifi \i4 

= E i+ i {cos 2ifi \<f>i) - sin 2tfi \<f>j 
= -sm2<pi E i+ i\cf)l}, 

where we used Fact[TJ property [TJ Moreover, \4>f) _L \4>i), so |||u;j + i)|| < (3i + \ sin 2^. Finally, comparing 
Eq. © and ©, 

I sin (fi+i — sin 3(pi \ < sin 2ipi . 

We may now bound the error defined in Eq. © as: 

< I sin (fi+i — sin 3ipi | + | sin 3p>i — sin <p~i+i \ 

< sin 2ip.i + I sin 3tpi — sin 3(f>i\ 

< f3 i+ i (sin 2(pi + I sin 2(pt — sin 2(pi | ) + | sin 3ipi — sin 3(p>i \ 

< (sin 2(p,i + 2ei) + 3ej 

< 2fl i+1 (<pi + ej) + 3ej, 

where we have used the triangle inequality and the following trigonometric inequalities 

I sin 2A - sin 2B\ < 2\ sin A - sin B\ 
I sin 3A — sin 3B \ < 3| sin^4 — sini?| 
sin A < A 

that hold for A, B e [0, vr/4]. 

We define a quantity ej, intended to be an upper bound on ej (this would hold if ej < (p>i). Let 

e = 
e l+ i = 4:(5 i+ i(pi + 3e"j. 
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We show that e, < 7 for every i < t, if we take so that they define a convergent series. Indeed, let us 
define Uj as 

ei = ifiiUi. 
We therefore have the following recursion for ui 

uq = 

4 

u.j + i = Ui + — (Vz > 0) 

37 

so that 

4 * 
i=i 

Since we have assumed that {/%} define a convergent series, the non-decreasing sequence (ui) tends to some 
real when i —> 00. We therefore have < ^fu^ipi. 

We now set /3j = ^j/i 2 - Therefore = 1/tt and we have efj < 7^j/vr < 7 since < (pt < tt for 
£ < i. 

Since < 7 < 1, we have ej < and we can show by induction that ej < ej for all i < t. This 
finishes the error analysis. 

Complexity. We now evaluate the complexity of our algorithm. We know from the hypotheses of the 
theorem that applying R{(3i) has a cost c\ log Jr, while applying ref(.M) has a cost Q. Moreover, we see 
from Eq. (Q]) that applying Ai requires 3 calls to A^x, one call to R{[3i) and one call to ref(A^). Hence, if 
we denote Cost(?) the cost of applying Ai, we have 

Cost(0) = 

Cost(i) = 3 Cost(i - 1) + ci log ^- + c 2 . 

Pi 

Since we have fixed /3j = -^zj/i 2 , we find 

Cost(i) = (c 1 [21og- + 0(l)))^3^(logj)+c 2 ^3 i ^ 

W +0(1) ))£^ +C2 £i 

3=1 j=i 

where both sums converge as i — > 00. After i steps we have Cost(i) = O • (ci log ^ + C2 

If the cost refers to time complexity, then there is an additional term pertaining to the reflection 
This arises from the check to see if the ancilla used are in state \0 S ). This does not change the asymptotic 
complexity of the algorithm. □ 

We now have all the elements to prove Theorem [3] 

Proof of Theorem^} The algorithm consists in the recursive quantum search from Theorem [8j using for the 
approximate reflections R{j3) the quantum phase estimation circuit R{P) from Theorem [6] Setting k = 

((log 4) + l)/c in Theorem[6l R{P) simulates a reflection with an error upper bounded by 2 1 ~ ck < (3. 

Implementing R{P)d then requires k 2 S+1 calls to the controlled quantum walk c— W(P)d or its inverse, 
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where s = log(^=) + 0(1). Since implementing c— W(P)d or its inverse has a cost 4U, the cost of 
implementing the circuit R(P)d for a given error (3 is c\ log ^, with c\ of order A= U. Furthermore, preparing 
the initial state jvr)^ has a cost S + U, and implementing — ref {M)d nas a cost C2 = C. Therefore total cost 
of the recursive search algorithm from Theorem [8] is of order S + -W4=U + C). □ 



5 Non-reversible Markov chains 



In this section, we discuss the performance of the search algorithm presented earlier for any ergodic, but 
possibly non-reversible Markov chain P. For the analysis of the quantum walk W(P) we directly examine 
the singular value decomposition of the discriminant matrix D(P) = diag(vr) 1 / 2 • P ■ diag(-7r) -1 / 2 . This 
matrix has the same eigenvalues as P, but the singular values of D{P) may be different from the eigenvalues 
of P. The singular values of D{P) lie in the interval [0, 1]. The vector v = (y^) is both a left and a right 
eigenvector of D{P) with eigenvalue 1. Therefore, Span(t>) and Span(u)- 1 - are invariant subspaces of D(P), 
and we may choose v to be a left and right singular vector. If every singular vector orthogonal to v has a 
singular value strictly smaller than 1, that is D(P) has a non-zero singular value gap, then Theorem [3] and 
its proof stay valid when the eigenvalue gap of P is replaced by the singular value gap of D(P). 

The discriminant of an irreducible walk does not necessarily have non-zero singular value gap, even if 
it is ergodic. Ergodicity implies a non-zero eigenvalue gap for P, but there are examples of ergodic Markov 
chain whose discriminant has singular value gap. In the next proposition we show that if for every state, 
the Markov has a transition to itself with non-zero probability, then its discriminant has non-zero singular 
value gap. There is a standard and simple modification to any Markov chain P such that the resulting chain 
has this property: with some probability a G (0, 1), stay at the current state, and with probability 1 — a, 
make a transition according to P. 

Proposition 3. Let P = (p xy ) be an irreducible Markov chain on a finite state space X, such that p xx > 0, 
for every x £ X. Then, the discriminant matrix D(P) has exactly one singular value equal to 1. 

We first state and prove that all the singular values of D(P) lie in [0, 1]. 

Lemma 1. Let P = (p X y)x,yeX be an irreducible Markov chain with stationary distribution tt = (tt x ) X £X- 
Then the singular values of the matrix D(P) given by 

D(P) = diag(vr) 1/2 • P ■ diag(vr)- 1/2 

all lie in the interval [0, 1]. 

Proof. Singular values are by convention taken to be non-negative real. To verify that ||D(P)||, the largest 
singular value of D(P) is at most 1, consider the inner product u Jf D(P)v, for some unit vectors u, v. The 
maximum absolute value that this inner product achieves is the norm of D(P). By the Cauchy-Schwartz 
inequality, the inner product may be bounded as 




< 1, 



since J2 X n x p xy = vr y . □ 
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Proof of Proposition^ From Lemma [T] we know that the singular values of D(P) all lie in [0, 1]. Fur- 
ther v = (y/Ttx) is a left (and right) singular vector with singular value 1. We show below that for any left 
and right singular vectors u,w € C x , if v) D{P) w = 1, then u = w = v (modulo an overall phase). This 
establishes the uniqueness of the singular value 1 and a non-zero singular value gap in D(P). 

Suppose «t D(P) w = 1. This implies that the Cauchy-Schwartz inequality in Equation (0]) in the proof 
of Lemma Q] is tight. Then necessarily, the two unit vectors u',w' G C XxX given by u' = (u x ^/p^) X:y& x 
and w' = (w y y 7 ir x p xy / TT y ) x ,y&x are equal. This means that for every pair x,y £ X such that p xy > 0, 
Ux = w y \pKxpKy. In particular, since p xx > 0, u x = w x for every x, and so u x = u y ^j 'n x /TT y for every 
neighbour y of x in the graph underlying the Markov chain P. 

Furthermore, for any path x\, X2, . . . , Xk in the graph, chaining together the equations 



for i = 1, . . . , k — 1, we get that 



*xi 



7T, 



for every i. Since the chain P is irreducible, i.e., the underlying graph is strongly connected, there is path 
from x\ to y for every y G X. Thus, 



7T, 



for every y. Since the vector u is a unit vector, this implies that u = w = (y/T^) x& x = v (up to an 
unimportant global phase). □ 

Let us finally state the theorem on our quantum search algorithm when the underlying Markov chain is 
not necessary reversible. 

Theorem 9. Let P = (p xy ) be an irreducible Markov chain on a finite state space X, such that D(P) has 
exactly one singular value equal to 1. Let 5 > be the singular value gap of D(P), and let e > be 
a lower bound on the probability that an element chosen from the stationary distribution of P is marked 
whenever M is non-empty. Then, there is a quantum algorithm that determines, with high probability, if M 
is empty or finds an element of M with cost of order S + -^(-^U + C). 



6 Conclusion 

We proposed a new method for designing quantum search algorithms for finding a "marked" element in the 
state space of a classical Markov chain. Our algorithm expands the scope of earlier work due to Ambainis (H 
and Szegedy [24] and combines their advantages into a unified approach. An upshot is the improvement in 
various aspects of many known algorithms based on quantum walk. The method is also conceptually and 
technically simpler, and thus seems to demystify the role of quantum walks in search algorithms. 

Theorem[2]of Szegedy may be strengthened to give a quadratic relationship between a notion of quantum 
hitting time and the classical hitting time |[24l (cf. the remarks at the end of Section [T3T ). Our schema trades 
this relationship with classical hitting time for a form of run time that is more conducive to composition 
of algorithms. As a result, a direct application of our schema to search in special classes of graph such as 
the d-dimensional toroidal grid (for any constant d) leads to algorithms that are asymptotically slower than 
known ones 1121 H4j [24j . We believe that these results could be recast in our framework, but leave it open 
for the moment. 
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